Genomic signature: characterization and classification of species assessed by chaos game representation of sequences.

نویسندگان

  • P J Deschavanne
  • A Giron
  • J Vilain
  • G Fagot
  • B Fertil
چکیده

We explored DNA structures of genomes by means of a new tool derived from the "chaotic dynamical systems" theory (the so-called chaos game representation [CGR]), which allows the depiction of frequencies of oligonucleotides in the form of images. Using CGR, we observe that subsequences of a genome exhibit the main characteristics of the whole genome, attesting to the validity of the genomic signature concept. Base concentrations, stretches (runs of complementary bases or purines/pyrimidines), and patches (over- or underexpressed words of various lengths) are the main factors explaining the variability observed among sequences. The distance between images may be considered a measure of phylogenetic proximity. Eukaryotes and prokaryotes can be identified merely on the basis of their DNA structures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian classification for promoter prediction in human DNA sequences

Many Computational methods are yet available for data retrieval and analysis of genomic sequences, but some functional sites are difficult to characterize. In this work, we examine the problem of promoter localization in human DNA sequences. Promoters are regulatory regions that governs the expression of genes, and their prediction is reputed difficult, so that this issue is still open. We pres...

متن کامل

Self-Similarity Limits of Genomic Signatures

It is shown that metric representation of DNA sequences is one-to-one. By using the metric representation method, suppression of nucleotide strings in the DNA sequences is determined. For a DNA sequence, an optimal string length to display genomic signature in chaos game representation is obtained by eliminating effects of the finite sequence. The optimal string length is further shown as a sel...

متن کامل

Secondary Structural Analysis of Families of Protein Sequences using Chaos Game Representation

CGR is an effective method for visualizing any structural features if it is given as a sequence of elements [1,2] analyzed by the genomic signature appears as a powerful tool for investigating the mechanisms of DNA maintenance from which the DNA structure results. It would be necessary to understand the patterns they exhibit and to be able to interpret them in a biologically meaningful way [3]....

متن کامل

Genomic Signature Is Preserved in Short DNA Fragments

The recent availability of complete genomes opens a new field of research devoted to the general analysis of their global structure without regard to gene interpretation. The Chaos Game Representation of DNA sequence [2], when modified to allow for quantification, displays the whole set of frequencies of words found in a given genomic sequence under the form of images where the value of each pi...

متن کامل

The spectrum of genomic signatures: from dinucleotides to chaos game representation.

In the post genomic era, access to complete genome sequence data for numerous diverse species has opened multiple avenues for examining and comparing primary DNA sequence organization of entire genomes. Previously, the concept of a genomic signature was introduced with the observation of species-type specific Dinucleotide Relative Abundance Profiles (DRAPs); dinucleotides were identified as the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Molecular biology and evolution

دوره 16 10  شماره 

صفحات  -

تاریخ انتشار 1999